Parameter Differentiation Based Multilingual Neural Machine Translation
نویسندگان
چکیده
Multilingual neural machine translation (MNMT) aims to translate multiple languages with a single model and has been proved successful thanks effective knowledge transfer among different shared parameters. However, it is still an open question which parameters should be ones need task-specific. Currently, the common practice heuristically design or search language-specific modules, difficult find optimal configuration. In this paper, we propose novel parameter differentiation based method that allows determine language-specific during training. Inspired by cellular differentiation, each in our can dynamically differentiate into more specialized types. We further define criterion as inter-task gradient similarity. Therefore, conflicting gradients are likely language-specific. Extensive experiments on multilingual datasets have demonstrated significantly outperforms various strong baselines sharing configurations. Further analysis reveals configuration obtained correlates well linguistic proximities.
منابع مشابه
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
We propose a simple solution to use a single Neural Machine Translation (NMT) model to translate between multiple languages. Our solution requires no changes to the model architecture from a standard NMT system but instead introduces an artificial token at the beginning of the input sentence to specify the required target language. The rest of the model, which includes an encoder, decoder and a...
متن کاملApplications in Multilingual Machine Translation Applications in Multilingual Machine Translation
The CAT2 Machine Translation System, developed in Saarbr ucken in 1987, is a natural language application coded entirely in Prolog. Since its initial development, several languages have been implemented on an experimental basis to evaluate the translation methodology, the underlying formalism, the linguistic descriptions, and the e ectiveness of the Prolog implementation. Seven years later, it...
متن کاملMultilingual Neural Machine Translation for Low Resource Languages
Neural Machine Translation (NMT) has been shown to be more effective in translation tasks compared to the Phrase-Based Statistical Machine Translation (PBMT). However, NMT systems are limited in translating low-resource languages (LRL), due to the fact that neural methods require a large amount of parallel data to learn effective mappings between languages. In this work we show how so-called mu...
متن کاملToward Multilingual Neural Machine Translation with Universal Encoder and Decoder
In this paper, we present our first attempts in building a multilingual Neural Machine Translation framework under a unified approach in which the information shared among languages can be helpful in the translation of individual language pairs. We are then able to employ attention-based Neural Machine Translation for many-to-many multilingual translation tasks. Our approach does not require an...
متن کاملLearning Joint Multilingual Sentence Representations with Neural Machine Translation
In this paper, we use the framework of neural machine translation to learn joint sentence representations across six very different languages. Our aim is that a representation which is independent of the language, is likely to capture the underlying semantics. We define a new crosslingual similarity measure, compare up to 1.4M sentence representations and study the characteristics of close sent...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i10.21396